Chinese Main Verb Identification: From Specification to Realization
نویسندگان
چکیده
Main verb identification is the task of automatically identifying the predicate-verb in a sentence. It is useful for many applications in Chinese Natural Language Processing. Although most studies have focused on the model used to identify the main verb, the definition of the main verb should not be overlooked. In our specification design, we have found many complicated issues that still need to be resolved since they haven’t been well discussed in previous works. Thus, the first novel aspect of our work is that we carefully design a specification for annotating the main verb and investigate various complicated cases. We hope this discussion will help to uncover the difficulties involved in this problem. Secondly, we present an approach to realizing main verb identification based on the use of chunk information, which leads to better results than the approach based on part-of-speech. Finally, based on careful observation of the studied corpus, we propose new local and contextual features for main verb identification. According to our specification, we annotate a corpus and then use a Support Vector Machine (SVM) to integrate all the features we propose. Our model, which was trained on our annotated corpus, achieved a promising F score of 92.8%. Furthermore, we show that main verb identification can improve the performance of the Chinese Sentence Breaker, one of the applications of main verb identification, by 2.4%.
منابع مشابه
“Those Nation Wreckers are Suffering from Inferiority Complex”: The Depiction of Chinese Miners in the Ghanaian Press
This article studies the depiction of Chinese miners in the Ghanaian news website entitled Modern Ghana. A total of 87 articles comprising 43752 words were retrieved. Van Leeuwen’s (2008) theory of the representation of the social actors was utilised to examine the depiction of Chinese miners in the Ghanaian press. In this regard, six applicable tools were used and these include exclusion, role...
متن کاملLexicalization Typology of Realization Events in Mandarin Chinese
There has been a hot debate on the typological status of Mandarin Chinese in Talmyan framework of Verb-framed languages (V-languages) and Satellite-framed Languages(S-languages). However, most previous studies focus on motion events, while other macro-events (Talmy, 2000) receive little attention. The present study aims to investigate event of realization in Mandarin Chinese with experimental m...
متن کاملChinese Resultative Verb Compounds: Lexicalization and Grammaticalization
This paper is an historical study of the formation of the Chinese resultative verb compounds (RVCs) that signal a resultant state of a non-agent with a V1V2 predicate. Metaphorization and metonymization, understood within the theoretical framework of Brinton & Traugott (2005), are proposed to have played a most important role in the formation of the RVC in Middle Chinese. Many scholars noted (W...
متن کاملA Constructional Approach to Argument Realization of Chinese Resultatives
This paper argues for a constructional view of the resultative constructions, specifically the resultative-verb compounds (henceforth RVCs). We claim that the effect of the construction must be taken into account in the realization of arguments, and the realization must be moderated by the linking rules. This paper is organized as follows: Section 2 provides a brief definition of resultatives i...
متن کاملMotion events in Chinese novels: Evidence for an equipollently-framed language
Motion events typically involve an entity moving along a path in a certain manner. Research on language typology has identified three types of languages based on the characteristic expression of manner and path information. In satellite-framed languages, the main verb expresses information about manner of movement and a subordinate satellite element (e.g., a verb particle) to the verb conveys t...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- IJCLCLP
دوره 10 شماره
صفحات -
تاریخ انتشار 2005